Extracting Comprehensible Models from Trained Neural Networks
نویسنده
چکیده
Although neural networks have been used to develop highly accurate classifiers in numerous real-world problem domains, the models they learn are notoriously difficult to understand. This thesis investigates the task of extracting comprehensible models from trained neural networks, thereby alleviating this limitation. The primary contribution of the thesis is an algorithm that overcomes the significant limitations of previous methods by taking a novel approach to the task of extracting comprehensible models from trained networks. This algorithm, called Trepan, views the task as an inductive learning problem. Given a trained network, or any other learned model, Trepan uses queries to induce a decision tree that approximates the function represented by the model. Unlike previous work in this area, Trepan is broadly applicable as well as scalable to large networks and problems with high-dimensional input spaces. The thesis presents experiments that evaluate Trepan by applying it to individual networks and to ensembles of neural networks trained in classification, regression, and reinforcement-learning domains. These experiments demonstrate that Trepan is able to extract decision trees that are comprehensible, yet maintain high levels of fidelity to their respective networks. In problem domains in which neural networks provide superior predictive accuracy to conventional decision tree algorithms, the trees extracted by Trepan also exhibit superior accuracy, but are comparable in terms of complexity, to the trees learned directly from the training data. A secondary contribution of this thesis is an algorithm, called BBP, that constructively induces simple neural networks. The motivation underlying this algorithm is similar to that for Trepan: to learn comprehensible models in problem domains in which neural networks have an especially appropriate inductive bias. The BBP algorithm, which is based on a hypothesis-boosting method, learns perceptrons that have relatively few connections. This algorithm provides an appealing combination of strengths: it provides learnability guarantees for a fairly natural class of target functions; it provides good predictive accuracy in a variety of problem domains; and it constructs syntactically simple models, thereby facilitating human comprehension of what it has learned. These algorithms provide mechanisms for improving the understanding of what a trained neural network has learned.
منابع مشابه
Knowledge Extraction from the Neural ‘Black Box’ in Ecological Monitoring
Phytoplankton biomass within the Saginaw Bay ecosystem (Lake Huron, Michigan, USA) was characterized as a function of select physical/chemical indicators. The complexity and variability of ecological systems typically make it difficult to model the influences of anthropogenic stressors and/or natural disturbances. Here, Artificial Neural Networks (ANNs) were developed to model chlorophyll a con...
متن کاملAn Overview of Research at Wisconsin on Knowledge-Based Neural Networks
Recent research at the University of Wisconsin on knowledge-based neural networks is surveyed. This work has focused on (a) using symbolically represented background knowledge to improve neural-network learning and (b) extracting comprehensible symbolic representations from trained networks. Important open issues are discussed.
متن کاملExtracting Thee-Structured Representations of Thained Networks
A significant limitation of neural networks is that the representations they learn are usually incomprehensible to humans. We present a novel algorithm, TREPAN, for extracting comprehensible , symbolic representations from trained neural networks. Our algorithm uses queries to induce a decision tree that approximates the concept represented by a given network. Our experiments demonstrate that T...
متن کاملExtracting Fuzzy Symbolic Representation from Artificial Neural Networks
This paper presents FUZZYTREPAN, a pedagogical approach to the problem of extracting comprehensible symbolic knowledge from trained Artificial Neural Networks. This approach extends the previously proposed TREPAN method in two ways: it uses fuzzy representation in its knowledge-extraction process (by means of Fuzzy Decision Trees), and it uses additional heuristics in its process of generating ...
متن کاملExtracting Tree-Structured Representations of Trained Networks
A signiicant limitation of neural networks is that the representations they learn are usually incomprehensible to humans. We present a novel algorithm, Trepan, for extracting comprehensible, symbolic representations from trained neural networks. Our algorithm uses queries to induce a decision tree that approximates the concept represented by a given network. Our experiments demonstrate that Tre...
متن کامل